NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

Rule-Enhanced Active Learning for Semi-Automated Weak Supervision

https://doi.org/10.3390/ai3010013

Kartchner, David; Nakajima An, Davi; Ren, Wendi; Zhang, Chao; Mitchell, Cassie S. (March 2022, AI)

A major bottleneck preventing the extension of deep learning systems to new domains is the prohibitive cost of acquiring sufficient training labels. Alternatives such as weak supervision, active learning, and fine-tuning of pretrained models reduce this burden but require substantial human input to select a highly informative subset of instances or to curate labeling functions. REGAL (Rule-Enhanced Generative Active Learning) is an improved framework for weakly supervised text classification that performs active learning over labeling functions rather than individual instances. REGAL interactively creates high-quality labeling patterns from raw text, enabling a single annotator to accurately label an entire dataset after initialization with three keywords for each class. Experiments demonstrate that REGAL extracts up to 3 times as many high-accuracy labeling functions from text as current state-of-the-art methods for interactive weak supervision, enabling REGAL to dramatically reduce the annotation burden of writing labeling functions for weak supervision. Statistical analysis reveals REGAL performs equal or significantly better than interactive weak supervision for five of six commonly used natural language processing (NLP) baseline datasets.
more » « less
Full Text Available
Denoising Multi-Source Weak Supervision for Neural Text Classification

Ren, Wendi; Li, Yinghao; Su, Hanting; Kartchner, David; Mitchell, Cassie; Zhang, Chao. (November 2021, Findings of Conference on Empirical Methods in Natural Language Processing)
null (Ed.)
Full Text Available
ReGAL: Rule-Generative Active Learning for Model-in-the-Loop Weak Supervision

Kartchner, David; Ren, Wendi; Nakajima An, David; Zhang, Chao; Mitchell, Cassie S. (October 2020, Advances in neural information processing systems)
null (Ed.)
Full Text Available
Denoising Multi-Source Weak Supervision for Neural Text Classification

Ren, Wendi; Li, Yinghao; Su, Hanting; Kartchner, David; Mitchell, Cassie S.; Zhang, Chao (July 2020, onference on Empirical Methods in Natural Language Processing)
null (Ed.)
Full Text Available

Search for: All records